Choosing an Identifying Set of Matching or Conditioning Variables
نویسندگان
چکیده
Political scientists estimate average causal effects with regression or matching techniques, but both techniques require the user to choose a set of matching or conditioning variables. In this paper, we show that the standard advice from both frameworks on how to choose an identifying set of variables is often insufficient and at times misleading. Furthermore, we argue that structural causal models (SCMs) provide an attractive framework to reason about this problem. SCMs provide simple rules for choosing an identifying set of variables given particular causal assumptions. This framework is equivalent to the Neyman-Rubin framework and common adjustment methods—such as those based on generalized linear models—can be analyzed within the framework. Finally, we demonstrate that SCMs allow the specification of causal modeling assumptions in a manner that is compatible with the mechanistic view of causation commonly invoked by political scientists. ∗The authors thank Thomas Richardson for introducing them to literature of graphical causal models, and also thank Neal Beck, Andy Eggers, Jim Greiner, Ben Hansen, Gary King, Judea Pearl, Anton Westveld, two anonymous referees, and the participants of 2007 Summer meetings of the Society of Political Methodology for their helpful comments and suggestions. The usual caveat applies. In addition, Quinn thanks the National Science Foundation (grants SES 03-50613 and BCS 05-27513) and the Center for Advanced Study in the Behavioral Sciences for its hospitality and support. †Department of Government and The Institute for Quantitative Social Sciences Harvard University, 1737 Cambridge Street, Cambridge, MA 02138. [email protected] ‡Department of Government and The Institute for Quantitative Social Sciences Harvard University, 1737 Cambridge Street, Cambridge, MA 02138. kevin [email protected]
منابع مشابه
Using Matching, Instrumental Variables, and Control Functions to Estimate Economic Choice Models
This paper investigates four topics. (1) It examines the different roles played by the propensity score (the probability of selection into treatment) in matching, instrumental variable, and control function methods. (2) It contrasts the roles of exclusion restrictions in matching and selection models. (3) It characterizes the sensitivity of matching to the choice of conditioning variables and d...
متن کاملTowards the Resolution of Coppersmith-Sorkin Conjectures
A k-matching is a set of k elements of a matrix, no two of which belong to the same row or column. The minimum weight k-matching of an m × n matrix C is the k-matching whose entries have the smallest sum. Coppersmith and Sorkin conjectured that if C is generated by choosing each entry independently from the exponential distribution of rate 1, then the expected value of the weight of the minimum...
متن کاملUsing Multiple-Variable Matching to Identify EFL Ecological Sources of Differential Item Functioning
Context is a vague notion with numerous building blocks making language test scores inferences quite convoluted. This study has made use of a model of item responding that has striven to theorize the contextual infrastructure of differential item functioning (DIF) research and help specify the sources of DIF. Two steps were taken in this research: first, to identify DIF by gender grouping via l...
متن کاملEstimating and Identifying Vector Autoregressions Under Diagonality and Block Exogeneity Restrictions
I show how to estimate and identify a large-scale vector autoregression when the variables in a subset of the system are mutually independent after conditioning on a separate set of variables (diagonality), and when the conditioning variables are independent of the former subset (block exogeneity). Least squares estimation is efficient, and restrictions only on the set of common variables are s...
متن کاملIdentifying Useful Variables for Vehicle Braking Using the Adjoint Matrix Approach to the Mahalanobis-Taguchi System
The Mahalanobis Taguchi System (MTS) is a diagnosis and forecasting method for multivariate data. Mahalanobis distance (MD) is a measure based on correlations between the variables and different patterns that can be identified and analyzed with respect to a base or reference group. MTS is of interest because of its reported accuracy in forecasting small, correlated data sets. This is the type o...
متن کامل